Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 2000000 |
| Missing cells | 2806671 |
| Missing cells (%) | 6.7% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 GiB |
| Average record size in memory | 845.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 10 |
| DateTime | 1 |
| Boolean | 1 |
Age has 31194 (1.6%) missing values | Missing |
Annual Income has 74809 (3.7%) missing values | Missing |
Marital Status has 30865 (1.5%) missing values | Missing |
Number of Dependents has 182802 (9.1%) missing values | Missing |
Occupation has 597200 (29.9%) missing values | Missing |
Health Score has 123525 (6.2%) missing values | Missing |
Previous Claims has 606831 (30.3%) missing values | Missing |
Credit Score has 229333 (11.5%) missing values | Missing |
Customer Feedback has 130100 (6.5%) missing values | Missing |
Premium Amount has 800000 (40.0%) missing values | Missing |
id is uniformly distributed | Uniform |
id has unique values | Unique |
Previous Claims has 508239 (25.4%) zeros | Zeros |
Vehicle Age has 102232 (5.1%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-02 14:58:54.137025 |
|---|---|
| Analysis finished | 2025-01-02 15:02:02.040429 |
| Duration | 3 minutes and 7.9 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
Uniform  Unique 
| Distinct | 2000000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 999999.5 |
| Minimum | 0 |
|---|---|
| Maximum | 1999999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 99999.95 |
| Q1 | 499999.75 |
| median | 999999.5 |
| Q3 | 1499999.2 |
| 95-th percentile | 1899999 |
| Maximum | 1999999 |
| Range | 1999999 |
| Interquartile range (IQR) | 999999.5 |
Descriptive statistics
| Standard deviation | 577350.41 |
|---|---|
| Coefficient of variation (CV) | 0.5773507 |
| Kurtosis | -1.2 |
| Mean | 999999.5 |
| Median Absolute Deviation (MAD) | 500000 |
| Skewness | -6.6744919 × 10-17 |
| Sum | 1.999999 × 1012 |
| Variance | 3.333335 × 1011 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1999983 | 1 | < 0.1% |
| 1999982 | 1 | < 0.1% |
| 1999981 | 1 | < 0.1% |
| 1999980 | 1 | < 0.1% |
| 1999979 | 1 | < 0.1% |
| 1999978 | 1 | < 0.1% |
| 1999977 | 1 | < 0.1% |
| 1999976 | 1 | < 0.1% |
| 1999975 | 1 | < 0.1% |
| 1999974 | 1 | < 0.1% |
| Other values (1999990) | 1999990 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 1999999 | 1 | |
| 1999998 | 1 | |
| 1999997 | 1 | |
| 1999996 | 1 | |
| 1999995 | 1 | |
| 1999994 | 1 | |
| 1999993 | 1 | |
| 1999992 | 1 | |
| 1999991 | 1 | |
| 1999990 | 1 |
Age
Real number (ℝ)
Missing 
| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31194 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.141914 |
| Minimum | 18 |
|---|---|
| Maximum | 64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 20 |
| Q1 | 30 |
| median | 41 |
| Q3 | 53 |
| 95-th percentile | 62 |
| Maximum | 64 |
| Range | 46 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 13.539099 |
|---|---|
| Coefficient of variation (CV) | 0.32908286 |
| Kurtosis | -1.1947603 |
| Mean | 41.141914 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.011542001 |
| Sum | 81000447 |
| Variance | 183.3072 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 53 | 43923 | 2.2% |
| 61 | 43804 | 2.2% |
| 64 | 43453 | 2.2% |
| 39 | 43397 | 2.2% |
| 43 | 43263 | 2.2% |
| 57 | 43214 | 2.2% |
| 33 | 43162 | 2.2% |
| 62 | 42919 | 2.1% |
| 46 | 42911 | 2.1% |
| 47 | 42852 | 2.1% |
| Other values (37) | 1535908 |
| Value | Count | Frequency (%) |
| 18 | 40639 | |
| 19 | 41299 | |
| 20 | 41896 | |
| 21 | 41446 | |
| 22 | 41877 | |
| 23 | 38791 | |
| 24 | 41001 | |
| 25 | 40373 | |
| 26 | 41348 | |
| 27 | 40635 |
| Value | Count | Frequency (%) |
| 64 | 43453 | |
| 63 | 40538 | |
| 62 | 42919 | |
| 61 | 43804 | |
| 60 | 41009 | |
| 59 | 41794 | |
| 58 | 42466 | |
| 57 | 43214 | |
| 56 | 42391 | |
| 55 | 41920 |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.99634 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Male | 1003660 | |
| Female | 996340 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 1003660 | |
| female | 996340 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2996340 | |
| a | 2000000 | |
| l | 2000000 | |
| M | 1003660 | 10.0% |
| F | 996340 | 10.0% |
| m | 996340 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9992680 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2996340 | |
| a | 2000000 | |
| l | 2000000 | |
| M | 1003660 | 10.0% |
| F | 996340 | 10.0% |
| m | 996340 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9992680 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2996340 | |
| a | 2000000 | |
| l | 2000000 | |
| M | 1003660 | 10.0% |
| F | 996340 | 10.0% |
| m | 996340 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9992680 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2996340 | |
| a | 2000000 | |
| l | 2000000 | |
| M | 1003660 | 10.0% |
| F | 996340 | 10.0% |
| m | 996340 | 10.0% |
Annual Income
Real number (ℝ)
Missing 
| Distinct | 97540 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 74809 |
| Missing (%) | 3.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32768.681 |
| Minimum | 1 |
|---|---|
| Maximum | 149997 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1207 |
| Q1 | 8021 |
| median | 23957 |
| Q3 | 44641 |
| 95-th percentile | 104563 |
| Maximum | 149997 |
| Range | 149996 |
| Interquartile range (IQR) | 36620 |
Descriptive statistics
| Standard deviation | 32188.136 |
|---|---|
| Coefficient of variation (CV) | 0.98228354 |
| Kurtosis | 1.785627 |
| Mean | 32768.681 |
| Median Absolute Deviation (MAD) | 17217 |
| Skewness | 1.4680117 |
| Sum | 6.308597 × 1010 |
| Variance | 1.0360761 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7073 | 1751 | 0.1% |
| 16054 | 1698 | 0.1% |
| 24897 | 1572 | 0.1% |
| 14094 | 1513 | 0.1% |
| 15983 | 1464 | 0.1% |
| 7991 | 1429 | 0.1% |
| 13982 | 1425 | 0.1% |
| 16076 | 1394 | 0.1% |
| 16891 | 1304 | 0.1% |
| 17091 | 1271 | 0.1% |
| Other values (97530) | 1910370 | |
| (Missing) | 74809 | 3.7% |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 2 | 9 | |
| 3 | 8 | |
| 5 | 7 | |
| 7 | 3 | < 0.1% |
| 8 | 5 | < 0.1% |
| 10 | 4 | < 0.1% |
| 11 | 17 | |
| 12 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 149997 | 6 | < 0.1% |
| 149996 | 32 | |
| 149995 | 8 | < 0.1% |
| 149994 | 4 | < 0.1% |
| 149993 | 6 | < 0.1% |
| 149992 | 13 | |
| 149991 | 5 | < 0.1% |
| 149990 | 8 | < 0.1% |
| 149989 | 3 | < 0.1% |
| 149987 | 5 | < 0.1% |
Marital Status
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30865 |
| Missing (%) | 1.5% |
| Memory size | 237.3 MiB |
| Single | |
|---|---|
| Married | |
| Divorced |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.997184 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married |
|---|---|
| 2nd row | Divorced |
| 3rd row | Divorced |
| 4th row | Married |
| 5th row | Single |
Common Values
| Value | Count | Frequency (%) |
| Single | 659096 | |
| Married | 656488 | |
| Divorced | 653551 | |
| (Missing) | 30865 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| single | 659096 | |
| married | 656488 | |
| divorced | 653551 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1969135 | |
| e | 1969135 | |
| r | 1966527 | |
| d | 1310039 | |
| S | 659096 | 4.8% |
| n | 659096 | 4.8% |
| l | 659096 | 4.8% |
| g | 659096 | 4.8% |
| a | 656488 | 4.8% |
| M | 656488 | 4.8% |
| Other values (4) | 2614204 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 13778400 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 1969135 | |
| e | 1969135 | |
| r | 1966527 | |
| d | 1310039 | |
| S | 659096 | 4.8% |
| n | 659096 | 4.8% |
| l | 659096 | 4.8% |
| g | 659096 | 4.8% |
| a | 656488 | 4.8% |
| M | 656488 | 4.8% |
| Other values (4) | 2614204 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 13778400 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 1969135 | |
| e | 1969135 | |
| r | 1966527 | |
| d | 1310039 | |
| S | 659096 | 4.8% |
| n | 659096 | 4.8% |
| l | 659096 | 4.8% |
| g | 659096 | 4.8% |
| a | 656488 | 4.8% |
| M | 656488 | 4.8% |
| Other values (4) | 2614204 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 13778400 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 1969135 | |
| e | 1969135 | |
| r | 1966527 | |
| d | 1310039 | |
| S | 659096 | 4.8% |
| n | 659096 | 4.8% |
| l | 659096 | 4.8% |
| g | 659096 | 4.8% |
| a | 656488 | 4.8% |
| M | 656488 | 4.8% |
| Other values (4) | 2614204 |
Number of Dependents
Categorical
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 182802 |
| Missing (%) | 9.1% |
| Memory size | 229.2 MiB |
| 3.0 | |
|---|---|
| 4.0 | |
| 0.0 | |
| 2.0 | |
| 1.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 3.0 |
| 4th row | 2.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 369220 | |
| 4.0 | 366608 | |
| 0.0 | 362926 | |
| 2.0 | 359478 | |
| 1.0 | 358966 | |
| (Missing) | 182802 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3.0 | 369220 | |
| 4.0 | 366608 | |
| 0.0 | 362926 | |
| 2.0 | 359478 | |
| 1.0 | 358966 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2180124 | |
| . | 1817198 | |
| 3 | 369220 | 6.8% |
| 4 | 366608 | 6.7% |
| 2 | 359478 | 6.6% |
| 1 | 358966 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5451594 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2180124 | |
| . | 1817198 | |
| 3 | 369220 | 6.8% |
| 4 | 366608 | 6.7% |
| 2 | 359478 | 6.6% |
| 1 | 358966 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5451594 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2180124 | |
| . | 1817198 | |
| 3 | 369220 | 6.8% |
| 4 | 366608 | 6.7% |
| 2 | 359478 | 6.6% |
| 1 | 358966 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5451594 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2180124 | |
| . | 1817198 | |
| 3 | 369220 | 6.8% |
| 4 | 366608 | 6.7% |
| 2 | 359478 | 6.6% |
| 1 | 358966 | 6.6% |
Education Level
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 239.4 MiB |
| Master's | |
|---|---|
| PhD | |
| Bachelor's | |
| High School |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 7.9638165 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Bachelor's |
|---|---|
| 2nd row | Master's |
| 3rd row | High School |
| 4th row | Bachelor's |
| 5th row | Bachelor's |
Common Values
| Value | Count | Frequency (%) |
| Master's | 506370 | |
| PhD | 505975 | |
| Bachelor's | 505457 | |
| High School | 482198 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| master's | 506370 | |
| phd | 505975 | |
| bachelor's | 505457 | |
| high | 482198 | |
| school | 482198 |
Most occurring characters
| Value | Count | Frequency (%) |
| h | 1975828 | |
| s | 1518197 | 9.5% |
| o | 1469853 | 9.2% |
| r | 1011827 | 6.4% |
| a | 1011827 | 6.4% |
| ' | 1011827 | 6.4% |
| e | 1011827 | 6.4% |
| l | 987655 | 6.2% |
| c | 987655 | 6.2% |
| M | 506370 | 3.2% |
| Other values (9) | 4434767 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15927633 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| h | 1975828 | |
| s | 1518197 | 9.5% |
| o | 1469853 | 9.2% |
| r | 1011827 | 6.4% |
| a | 1011827 | 6.4% |
| ' | 1011827 | 6.4% |
| e | 1011827 | 6.4% |
| l | 987655 | 6.2% |
| c | 987655 | 6.2% |
| M | 506370 | 3.2% |
| Other values (9) | 4434767 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15927633 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| h | 1975828 | |
| s | 1518197 | 9.5% |
| o | 1469853 | 9.2% |
| r | 1011827 | 6.4% |
| a | 1011827 | 6.4% |
| ' | 1011827 | 6.4% |
| e | 1011827 | 6.4% |
| l | 987655 | 6.2% |
| c | 987655 | 6.2% |
| M | 506370 | 3.2% |
| Other values (9) | 4434767 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15927633 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| h | 1975828 | |
| s | 1518197 | 9.5% |
| o | 1469853 | 9.2% |
| r | 1011827 | 6.4% |
| a | 1011827 | 6.4% |
| ' | 1011827 | 6.4% |
| e | 1011827 | 6.4% |
| l | 987655 | 6.2% |
| c | 987655 | 6.2% |
| M | 506370 | 3.2% |
| Other values (9) | 4434767 |
Occupation
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 597200 |
| Missing (%) | 29.9% |
| Memory size | 237.4 MiB |
| Employed | |
|---|---|
| Self-Employed | |
| Unemployed |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 10.334517 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Self-Employed |
|---|---|
| 2nd row | Self-Employed |
| 3rd row | Self-Employed |
| 4th row | Employed |
| 5th row | Employed |
Common Values
| Value | Count | Frequency (%) |
| Employed | 471324 | |
| Self-Employed | 470636 | |
| Unemployed | 460840 | |
| (Missing) | 597200 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| employed | 471324 | |
| self-employed | 470636 | |
| unemployed | 460840 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2334276 | |
| l | 1873436 | |
| p | 1402800 | |
| o | 1402800 | |
| m | 1402800 | |
| d | 1402800 | |
| y | 1402800 | |
| E | 941960 | |
| S | 470636 | 3.2% |
| f | 470636 | 3.2% |
| Other values (3) | 1392316 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 14497260 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2334276 | |
| l | 1873436 | |
| p | 1402800 | |
| o | 1402800 | |
| m | 1402800 | |
| d | 1402800 | |
| y | 1402800 | |
| E | 941960 | |
| S | 470636 | 3.2% |
| f | 470636 | 3.2% |
| Other values (3) | 1392316 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 14497260 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2334276 | |
| l | 1873436 | |
| p | 1402800 | |
| o | 1402800 | |
| m | 1402800 | |
| d | 1402800 | |
| y | 1402800 | |
| E | 941960 | |
| S | 470636 | 3.2% |
| f | 470636 | 3.2% |
| Other values (3) | 1392316 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 14497260 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2334276 | |
| l | 1873436 | |
| p | 1402800 | |
| o | 1402800 | |
| m | 1402800 | |
| d | 1402800 | |
| y | 1402800 | |
| E | 941960 | |
| S | 470636 | 3.2% |
| f | 470636 | 3.2% |
| Other values (3) | 1392316 |
Health Score
Real number (ℝ)
Missing 
| Distinct | 811360 |
|---|---|
| Distinct (%) | 43.2% |
| Missing | 123525 |
| Missing (%) | 6.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.613559 |
| Minimum | 1.6465608 |
|---|---|
| Maximum | 58.975914 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 1.6465608 |
|---|---|
| 5-th percentile | 7.2914169 |
| Q1 | 15.918658 |
| median | 24.579581 |
| Q3 | 34.52391 |
| 95-th percentile | 47.614835 |
| Maximum | 58.975914 |
| Range | 57.329353 |
| Interquartile range (IQR) | 18.605252 |
Descriptive statistics
| Standard deviation | 12.204827 |
|---|---|
| Coefficient of variation (CV) | 0.47649867 |
| Kurtosis | -0.7850883 |
| Mean | 25.613559 |
| Median Absolute Deviation (MAD) | 9.2384519 |
| Skewness | 0.28239675 |
| Sum | 48063203 |
| Variance | 148.9578 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.92724142 | 207 | < 0.1% |
| 19.8697009 | 202 | < 0.1% |
| 22.95540237 | 182 | < 0.1% |
| 25.90765016 | 182 | < 0.1% |
| 20.63784183 | 158 | < 0.1% |
| 27.8450064 | 156 | < 0.1% |
| 23.95570971 | 154 | < 0.1% |
| 10.9581528 | 151 | < 0.1% |
| 27.9294023 | 151 | < 0.1% |
| 24.85813464 | 144 | < 0.1% |
| Other values (811350) | 1874788 | |
| (Missing) | 123525 | 6.2% |
| Value | Count | Frequency (%) |
| 1.646560764 | 1 | < 0.1% |
| 2.012237182 | 1 | < 0.1% |
| 2.024415229 | 3 | |
| 2.036747412 | 1 | < 0.1% |
| 2.039338266 | 1 | < 0.1% |
| 2.039744021 | 1 | < 0.1% |
| 2.050052716 | 1 | < 0.1% |
| 2.053457869 | 1 | < 0.1% |
| 2.056558808 | 2 | |
| 2.060175622 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 58.97591405 | 1 | |
| 58.88603451 | 1 | |
| 58.5696892 | 1 | |
| 58.4524782 | 1 | |
| 58.40100949 | 1 | |
| 57.98884782 | 1 | |
| 57.95735079 | 1 | |
| 57.92381001 | 1 | |
| 57.90318089 | 1 | |
| 57.85252539 | 1 |
Location
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 235.6 MiB |
| Suburban | |
|---|---|
| Rural | |
| Urban |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 6.003098 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Rural |
| 3rd row | Suburban |
| 4th row | Rural |
| 5th row | Rural |
Common Values
| Value | Count | Frequency (%) |
| Suburban | 668732 | |
| Rural | 668067 | |
| Urban | 663201 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| suburban | 668732 | |
| rural | 668067 | |
| urban | 663201 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 2005531 | |
| b | 2000665 | |
| r | 2000000 | |
| a | 2000000 | |
| n | 1331933 | |
| S | 668732 | 5.6% |
| R | 668067 | 5.6% |
| l | 668067 | 5.6% |
| U | 663201 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12006196 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| u | 2005531 | |
| b | 2000665 | |
| r | 2000000 | |
| a | 2000000 | |
| n | 1331933 | |
| S | 668732 | 5.6% |
| R | 668067 | 5.6% |
| l | 668067 | 5.6% |
| U | 663201 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12006196 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| u | 2005531 | |
| b | 2000665 | |
| r | 2000000 | |
| a | 2000000 | |
| n | 1331933 | |
| S | 668732 | 5.6% |
| R | 668067 | 5.6% |
| l | 668067 | 5.6% |
| U | 663201 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12006196 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| u | 2005531 | |
| b | 2000665 | |
| r | 2000000 | |
| a | 2000000 | |
| n | 1331933 | |
| S | 668732 | 5.6% |
| R | 668067 | 5.6% |
| l | 668067 | 5.6% |
| U | 663201 | 5.5% |
Policy Type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 240.1 MiB |
| Premium | |
|---|---|
| Comprehensive | |
| Basic |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 8.332763 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Premium |
|---|---|
| 2nd row | Comprehensive |
| 3rd row | Premium |
| 4th row | Basic |
| 5th row | Premium |
Common Values
| Value | Count | Frequency (%) |
| Premium | 669475 | |
| Comprehensive | 665822 | |
| Basic | 664703 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| premium | 669475 | |
| comprehensive | 665822 | |
| basic | 664703 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2666941 | |
| m | 2004772 | |
| i | 2000000 | |
| r | 1335297 | 8.0% |
| s | 1330525 | 8.0% |
| P | 669475 | 4.0% |
| u | 669475 | 4.0% |
| C | 665822 | 4.0% |
| o | 665822 | 4.0% |
| p | 665822 | 4.0% |
| Other values (6) | 3991575 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 16665526 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2666941 | |
| m | 2004772 | |
| i | 2000000 | |
| r | 1335297 | 8.0% |
| s | 1330525 | 8.0% |
| P | 669475 | 4.0% |
| u | 669475 | 4.0% |
| C | 665822 | 4.0% |
| o | 665822 | 4.0% |
| p | 665822 | 4.0% |
| Other values (6) | 3991575 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 16665526 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2666941 | |
| m | 2004772 | |
| i | 2000000 | |
| r | 1335297 | 8.0% |
| s | 1330525 | 8.0% |
| P | 669475 | 4.0% |
| u | 669475 | 4.0% |
| C | 665822 | 4.0% |
| o | 665822 | 4.0% |
| p | 665822 | 4.0% |
| Other values (6) | 3991575 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 16665526 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2666941 | |
| m | 2004772 | |
| i | 2000000 | |
| r | 1335297 | 8.0% |
| s | 1330525 | 8.0% |
| P | 669475 | 4.0% |
| u | 669475 | 4.0% |
| C | 665822 | 4.0% |
| o | 665822 | 4.0% |
| p | 665822 | 4.0% |
| Other values (6) | 3991575 |
Previous Claims
Real number (ℝ)
Missing  Zeros 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 606831 |
| Missing (%) | 30.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0035624 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 508239 |
| Zeros (%) | 25.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.98282561 |
|---|---|
| Coefficient of variation (CV) | 0.97933684 |
| Kurtosis | 0.75296634 |
| Mean | 1.0035624 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.90556377 |
| Sum | 1398132 |
| Variance | 0.96594619 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 508239 | |
| 1 | 501692 | |
| 2 | 279761 | |
| 3 | 81764 | 4.1% |
| 4 | 17689 | 0.9% |
| 5 | 3411 | 0.2% |
| 6 | 506 | < 0.1% |
| 7 | 86 | < 0.1% |
| 8 | 12 | < 0.1% |
| 9 | 9 | < 0.1% |
| (Missing) | 606831 |
| Value | Count | Frequency (%) |
| 0 | 508239 | |
| 1 | 501692 | |
| 2 | 279761 | |
| 3 | 81764 | 4.1% |
| 4 | 17689 | 0.9% |
| 5 | 3411 | 0.2% |
| 6 | 506 | < 0.1% |
| 7 | 86 | < 0.1% |
| 8 | 12 | < 0.1% |
| 9 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 9 | < 0.1% |
| 8 | 12 | < 0.1% |
| 7 | 86 | < 0.1% |
| 6 | 506 | < 0.1% |
| 5 | 3411 | 0.2% |
| 4 | 17689 | 0.9% |
| 3 | 81764 | 4.1% |
| 2 | 279761 | |
| 1 | 501692 | |
| 0 | 508239 |
Vehicle Age
Real number (ℝ)
Zeros 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.5706896 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 102232 |
| Zeros (%) | 5.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 10 |
| Q3 | 15 |
| 95-th percentile | 19 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 5.7745923 |
|---|---|
| Coefficient of variation (CV) | 0.6033622 |
| Kurtosis | -1.2062471 |
| Mean | 9.5706896 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.020204736 |
| Sum | 19141293 |
| Variance | 33.345917 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17 | 103983 | 5.2% |
| 11 | 103039 | 5.2% |
| 0 | 102232 | 5.1% |
| 18 | 102134 | 5.1% |
| 10 | 101852 | 5.1% |
| 14 | 101642 | 5.1% |
| 15 | 101222 | 5.1% |
| 19 | 100979 | 5.0% |
| 12 | 100863 | 5.0% |
| 16 | 100642 | 5.0% |
| Other values (10) | 981403 |
| Value | Count | Frequency (%) |
| 0 | 102232 | |
| 1 | 95397 | |
| 2 | 99865 | |
| 3 | 98619 | |
| 4 | 97111 | |
| 5 | 99266 | |
| 6 | 96650 | |
| 7 | 99200 | |
| 8 | 97307 | |
| 9 | 99921 |
| Value | Count | Frequency (%) |
| 19 | 100979 | |
| 18 | 102134 | |
| 17 | 103983 | |
| 16 | 100642 | |
| 15 | 101222 | |
| 14 | 101642 | |
| 13 | 98067 | |
| 12 | 100863 | |
| 11 | 103039 | |
| 10 | 101852 |
Credit Score
Real number (ℝ)
Missing 
| Distinct | 550 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 229333 |
| Missing (%) | 11.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 592.91651 |
| Minimum | 300 |
|---|---|
| Maximum | 849 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 300 |
|---|---|
| 5-th percentile | 341 |
| Q1 | 468 |
| median | 595 |
| Q3 | 721 |
| 95-th percentile | 822 |
| Maximum | 849 |
| Range | 549 |
| Interquartile range (IQR) | 253 |
Descriptive statistics
| Standard deviation | 150.03571 |
|---|---|
| Coefficient of variation (CV) | 0.25304694 |
| Kurtosis | -1.0906138 |
| Mean | 592.91651 |
| Median Absolute Deviation (MAD) | 127 |
| Skewness | -0.11368412 |
| Sum | 1.0498577 × 109 |
| Variance | 22510.714 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 431 | 7142 | 0.4% |
| 434 | 7111 | 0.4% |
| 713 | 6698 | 0.3% |
| 757 | 6668 | 0.3% |
| 437 | 6458 | 0.3% |
| 613 | 6288 | 0.3% |
| 584 | 6275 | 0.3% |
| 658 | 6212 | 0.3% |
| 607 | 6199 | 0.3% |
| 734 | 6193 | 0.3% |
| Other values (540) | 1705423 | |
| (Missing) | 229333 | 11.5% |
| Value | Count | Frequency (%) |
| 300 | 1459 | |
| 301 | 2587 | |
| 302 | 1993 | |
| 303 | 1805 | |
| 304 | 1433 | |
| 305 | 1162 | 0.1% |
| 306 | 1433 | |
| 307 | 2145 | |
| 308 | 3044 | |
| 309 | 2217 |
| Value | Count | Frequency (%) |
| 849 | 2948 | |
| 848 | 3733 | |
| 847 | 3441 | |
| 846 | 3072 | |
| 845 | 2847 | |
| 844 | 3275 | |
| 843 | 3499 | |
| 842 | 3152 | |
| 841 | 3155 | |
| 840 | 1309 | 0.1% |
Insurance Duration
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.018511 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.5941017 |
|---|---|
| Coefficient of variation (CV) | 0.51690665 |
| Kurtosis | -1.2371694 |
| Mean | 5.018511 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.0080417665 |
| Sum | 10037007 |
| Variance | 6.7293638 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 229966 | |
| 1 | 224532 | |
| 8 | 222790 | |
| 7 | 222513 | |
| 5 | 221016 | |
| 3 | 220260 | |
| 4 | 220181 | |
| 6 | 219702 | |
| 2 | 219037 | |
| (Missing) | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 224532 | |
| 2 | 219037 | |
| 3 | 220260 | |
| 4 | 220181 | |
| 5 | 221016 | |
| 6 | 219702 | |
| 7 | 222513 | |
| 8 | 222790 | |
| 9 | 229966 |
| Value | Count | Frequency (%) |
| 9 | 229966 | |
| 8 | 222790 | |
| 7 | 222513 | |
| 6 | 219702 | |
| 5 | 221016 | |
| 4 | 220181 | |
| 3 | 220260 | |
| 2 | 219037 | |
| 1 | 224532 |
| Distinct | 173790 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 130.7 MiB |
| Minimum | 2019-08-17 15:21:39.080371 |
|---|---|
| Maximum | 2024-08-15 15:21:39.287115 |
Customer Feedback
Categorical
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 130100 |
| Missing (%) | 6.5% |
| Memory size | 233.0 MiB |
| Average | |
|---|---|
| Poor | |
| Good |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 5.0093406 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Poor |
|---|---|
| 2nd row | Average |
| 3rd row | Good |
| 4th row | Poor |
| 5th row | Poor |
Common Values
| Value | Count | Frequency (%) |
| Average | 629122 | |
| Poor | 625952 | |
| Good | 614826 | |
| (Missing) | 130100 | 6.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| average | 629122 | |
| poor | 625952 | |
| good | 614826 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2481556 | |
| e | 1258244 | |
| r | 1255074 | |
| A | 629122 | 6.7% |
| v | 629122 | 6.7% |
| a | 629122 | 6.7% |
| g | 629122 | 6.7% |
| P | 625952 | 6.7% |
| G | 614826 | 6.6% |
| d | 614826 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9366966 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2481556 | |
| e | 1258244 | |
| r | 1255074 | |
| A | 629122 | 6.7% |
| v | 629122 | 6.7% |
| a | 629122 | 6.7% |
| g | 629122 | 6.7% |
| P | 625952 | 6.7% |
| G | 614826 | 6.6% |
| d | 614826 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9366966 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2481556 | |
| e | 1258244 | |
| r | 1255074 | |
| A | 629122 | 6.7% |
| v | 629122 | 6.7% |
| a | 629122 | 6.7% |
| g | 629122 | 6.7% |
| P | 625952 | 6.7% |
| G | 614826 | 6.6% |
| d | 614826 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9366966 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2481556 | |
| e | 1258244 | |
| r | 1255074 | |
| A | 629122 | 6.7% |
| v | 629122 | 6.7% |
| a | 629122 | 6.7% |
| g | 629122 | 6.7% |
| P | 625952 | 6.7% |
| G | 614826 | 6.6% |
| d | 614826 | 6.6% |
Smoking Status
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 117.3 MiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 1003732 | |
| False | 996268 |
Exercise Frequency
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 235.6 MiB |
| Weekly | |
|---|---|
| Rarely | |
| Monthly | |
| Daily |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.0035435 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Weekly |
|---|---|
| 2nd row | Monthly |
| 3rd row | Weekly |
| 4th row | Daily |
| 5th row | Weekly |
Common Values
| Value | Count | Frequency (%) |
| Weekly | 510693 | |
| Rarely | 499934 | |
| Monthly | 498230 | |
| Daily | 491143 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| weekly | 510693 | |
| rarely | 499934 | |
| monthly | 498230 | |
| daily | 491143 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 2000000 | |
| y | 2000000 | |
| e | 1521320 | |
| a | 991077 | |
| W | 510693 | 4.3% |
| k | 510693 | 4.3% |
| R | 499934 | 4.2% |
| r | 499934 | 4.2% |
| M | 498230 | 4.1% |
| o | 498230 | 4.1% |
| Other values (5) | 2476976 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12007087 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 2000000 | |
| y | 2000000 | |
| e | 1521320 | |
| a | 991077 | |
| W | 510693 | 4.3% |
| k | 510693 | 4.3% |
| R | 499934 | 4.2% |
| r | 499934 | 4.2% |
| M | 498230 | 4.1% |
| o | 498230 | 4.1% |
| Other values (5) | 2476976 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12007087 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 2000000 | |
| y | 2000000 | |
| e | 1521320 | |
| a | 991077 | |
| W | 510693 | 4.3% |
| k | 510693 | 4.3% |
| R | 499934 | 4.2% |
| r | 499934 | 4.2% |
| M | 498230 | 4.1% |
| o | 498230 | 4.1% |
| Other values (5) | 2476976 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12007087 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 2000000 | |
| y | 2000000 | |
| e | 1521320 | |
| a | 991077 | |
| W | 510693 | 4.3% |
| k | 510693 | 4.3% |
| R | 499934 | 4.2% |
| r | 499934 | 4.2% |
| M | 498230 | 4.1% |
| o | 498230 | 4.1% |
| Other values (5) | 2476976 |
Property Type
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 236.2 MiB |
| House | |
|---|---|
| Condo | |
| Apartment |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 6.332044 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | House |
|---|---|
| 2nd row | House |
| 3rd row | House |
| 4th row | Apartment |
| 5th row | House |
Common Values
| Value | Count | Frequency (%) |
| House | 667500 | |
| Condo | 666478 | |
| Apartment | 666022 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| house | 667500 | |
| condo | 666478 | |
| apartment | 666022 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2000456 | |
| e | 1333522 | |
| n | 1332500 | |
| t | 1332044 | |
| H | 667500 | 5.3% |
| u | 667500 | 5.3% |
| s | 667500 | 5.3% |
| C | 666478 | 5.3% |
| d | 666478 | 5.3% |
| A | 666022 | 5.3% |
| Other values (4) | 2664088 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12664088 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 2000456 | |
| e | 1333522 | |
| n | 1332500 | |
| t | 1332044 | |
| H | 667500 | 5.3% |
| u | 667500 | 5.3% |
| s | 667500 | 5.3% |
| C | 666478 | 5.3% |
| d | 666478 | 5.3% |
| A | 666022 | 5.3% |
| Other values (4) | 2664088 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12664088 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 2000456 | |
| e | 1333522 | |
| n | 1332500 | |
| t | 1332044 | |
| H | 667500 | 5.3% |
| u | 667500 | 5.3% |
| s | 667500 | 5.3% |
| C | 666478 | 5.3% |
| d | 666478 | 5.3% |
| A | 666022 | 5.3% |
| Other values (4) | 2664088 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12664088 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 2000456 | |
| e | 1333522 | |
| n | 1332500 | |
| t | 1332044 | |
| H | 667500 | 5.3% |
| u | 667500 | 5.3% |
| s | 667500 | 5.3% |
| C | 666478 | 5.3% |
| d | 666478 | 5.3% |
| A | 666022 | 5.3% |
| Other values (4) | 2664088 |
Premium Amount
Real number (ℝ)
Missing 
| Distinct | 4794 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 800000 |
| Missing (%) | 40.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1102.5448 |
| Minimum | 20 |
|---|---|
| Maximum | 4999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 130.7 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 49 |
| Q1 | 514 |
| median | 872 |
| Q3 | 1509 |
| 95-th percentile | 2869 |
| Maximum | 4999 |
| Range | 4979 |
| Interquartile range (IQR) | 995 |
Descriptive statistics
| Standard deviation | 864.99886 |
|---|---|
| Coefficient of variation (CV) | 0.78454757 |
| Kurtosis | 1.5185856 |
| Mean | 1102.5448 |
| Median Absolute Deviation (MAD) | 449 |
| Skewness | 1.2409155 |
| Sum | 1.3230538 × 109 |
| Variance | 748223.03 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25 | 4268 | 0.2% |
| 24 | 3901 | 0.2% |
| 20 | 3849 | 0.2% |
| 23 | 3524 | 0.2% |
| 28 | 3418 | 0.2% |
| 26 | 3375 | 0.2% |
| 48 | 3307 | 0.2% |
| 29 | 3139 | 0.2% |
| 100 | 3125 | 0.2% |
| 27 | 3074 | 0.2% |
| Other values (4784) | 1165020 | |
| (Missing) | 800000 |
| Value | Count | Frequency (%) |
| 20 | 3849 | |
| 21 | 362 | < 0.1% |
| 22 | 1698 | 0.1% |
| 23 | 3524 | |
| 24 | 3901 | |
| 25 | 4268 | |
| 26 | 3375 | |
| 27 | 3074 | |
| 28 | 3418 | |
| 29 | 3139 |
| Value | Count | Frequency (%) |
| 4999 | 1 | < 0.1% |
| 4997 | 2 | < 0.1% |
| 4996 | 1 | < 0.1% |
| 4994 | 1 | < 0.1% |
| 4992 | 1 | < 0.1% |
| 4991 | 1 | < 0.1% |
| 4988 | 18 | |
| 4987 | 5 | < 0.1% |
| 4986 | 3 | < 0.1% |
| 4985 | 2 | < 0.1% |
Interactions
Correlations
| Age | Annual Income | Credit Score | Customer Feedback | Education Level | Exercise Frequency | Gender | Health Score | Insurance Duration | Location | Marital Status | Number of Dependents | Occupation | Policy Type | Premium Amount | Previous Claims | Property Type | Smoking Status | Vehicle Age | id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.000 | 0.002 | 0.000 | 0.000 | 0.001 | 0.000 | -0.000 | -0.001 | 0.000 | 0.002 | 0.000 | 0.001 | 0.000 | -0.002 | 0.001 | 0.001 | 0.000 | -0.002 | -0.000 |
| Annual Income | 0.000 | 1.000 | -0.151 | 0.002 | 0.002 | 0.003 | 0.000 | 0.013 | -0.001 | 0.000 | 0.001 | 0.002 | 0.001 | 0.002 | -0.061 | -0.000 | 0.002 | 0.000 | -0.001 | 0.001 |
| Credit Score | 0.002 | -0.151 | 1.000 | 0.001 | 0.003 | 0.000 | 0.002 | 0.010 | 0.000 | 0.002 | 0.003 | 0.004 | 0.003 | 0.001 | -0.044 | 0.038 | 0.002 | 0.002 | -0.000 | 0.001 |
| Customer Feedback | 0.000 | 0.002 | 0.001 | 1.000 | 0.001 | 0.003 | 0.000 | 0.004 | 0.001 | 0.001 | 0.001 | 0.000 | 0.002 | 0.000 | 0.001 | 0.003 | 0.002 | 0.000 | 0.000 | 0.000 |
| Education Level | 0.000 | 0.002 | 0.003 | 0.001 | 1.000 | 0.001 | 0.001 | 0.005 | 0.001 | 0.002 | 0.001 | 0.001 | 0.001 | 0.001 | 0.002 | 0.002 | 0.003 | 0.000 | 0.000 | 0.000 |
| Exercise Frequency | 0.001 | 0.003 | 0.000 | 0.003 | 0.001 | 1.000 | 0.001 | 0.004 | 0.001 | 0.001 | 0.001 | 0.001 | 0.002 | 0.002 | 0.001 | 0.001 | 0.001 | 0.000 | 0.002 | 0.001 |
| Gender | 0.000 | 0.000 | 0.002 | 0.000 | 0.001 | 0.001 | 1.000 | 0.006 | 0.001 | 0.001 | 0.002 | 0.000 | 0.000 | 0.001 | 0.002 | 0.000 | 0.001 | 0.003 | 0.001 | 0.001 |
| Health Score | -0.000 | 0.013 | 0.010 | 0.004 | 0.005 | 0.004 | 0.006 | 1.000 | 0.002 | 0.005 | 0.004 | 0.005 | 0.005 | 0.002 | 0.016 | 0.003 | 0.000 | 0.003 | -0.001 | 0.000 |
| Insurance Duration | -0.001 | -0.001 | 0.000 | 0.001 | 0.001 | 0.001 | 0.001 | 0.002 | 1.000 | 0.002 | 0.002 | 0.002 | 0.003 | 0.000 | -0.000 | 0.002 | 0.002 | 0.001 | 0.003 | -0.000 |
| Location | 0.000 | 0.000 | 0.002 | 0.001 | 0.002 | 0.001 | 0.001 | 0.005 | 0.002 | 1.000 | 0.002 | 0.001 | 0.001 | 0.001 | 0.002 | 0.000 | 0.000 | 0.001 | 0.001 | 0.002 |
| Marital Status | 0.002 | 0.001 | 0.003 | 0.001 | 0.001 | 0.001 | 0.002 | 0.004 | 0.002 | 0.002 | 1.000 | 0.001 | 0.002 | 0.000 | 0.000 | 0.003 | 0.002 | 0.002 | 0.001 | 0.000 |
| Number of Dependents | 0.000 | 0.002 | 0.004 | 0.000 | 0.001 | 0.001 | 0.000 | 0.005 | 0.002 | 0.001 | 0.001 | 1.000 | 0.000 | 0.000 | 0.004 | 0.005 | 0.002 | 0.001 | 0.002 | 0.001 |
| Occupation | 0.001 | 0.001 | 0.003 | 0.002 | 0.001 | 0.002 | 0.000 | 0.005 | 0.003 | 0.001 | 0.002 | 0.000 | 1.000 | 0.001 | 0.004 | 0.001 | 0.003 | 0.000 | 0.000 | 0.001 |
| Policy Type | 0.000 | 0.002 | 0.001 | 0.000 | 0.001 | 0.002 | 0.001 | 0.002 | 0.000 | 0.001 | 0.000 | 0.000 | 0.001 | 1.000 | 0.000 | 0.002 | 0.002 | 0.001 | 0.000 | 0.000 |
| Premium Amount | -0.002 | -0.061 | -0.044 | 0.001 | 0.002 | 0.001 | 0.002 | 0.016 | -0.000 | 0.002 | 0.000 | 0.004 | 0.004 | 0.000 | 1.000 | 0.045 | 0.002 | 0.003 | 0.001 | 0.001 |
| Previous Claims | 0.001 | -0.000 | 0.038 | 0.003 | 0.002 | 0.001 | 0.000 | 0.003 | 0.002 | 0.000 | 0.003 | 0.005 | 0.001 | 0.002 | 0.045 | 1.000 | 0.002 | 0.001 | -0.002 | 0.001 |
| Property Type | 0.001 | 0.002 | 0.002 | 0.002 | 0.003 | 0.001 | 0.001 | 0.000 | 0.002 | 0.000 | 0.002 | 0.002 | 0.003 | 0.002 | 0.002 | 0.002 | 1.000 | 0.001 | 0.003 | 0.000 |
| Smoking Status | 0.000 | 0.000 | 0.002 | 0.000 | 0.000 | 0.000 | 0.003 | 0.003 | 0.001 | 0.001 | 0.002 | 0.001 | 0.000 | 0.001 | 0.003 | 0.001 | 0.001 | 1.000 | 0.002 | 0.000 |
| Vehicle Age | -0.002 | -0.001 | -0.000 | 0.000 | 0.000 | 0.002 | 0.001 | -0.001 | 0.003 | 0.001 | 0.001 | 0.002 | 0.000 | 0.000 | 0.001 | -0.002 | 0.003 | 0.002 | 1.000 | -0.000 |
| id | -0.000 | 0.001 | 0.001 | 0.000 | 0.000 | 0.001 | 0.001 | 0.000 | -0.000 | 0.002 | 0.000 | 0.001 | 0.001 | 0.000 | 0.001 | 0.001 | 0.000 | 0.000 | -0.000 | 1.000 |
Missing values
Sample
| id | Age | Gender | Annual Income | Marital Status | Number of Dependents | Education Level | Occupation | Health Score | Location | Policy Type | Previous Claims | Vehicle Age | Credit Score | Insurance Duration | Policy Start Date | Customer Feedback | Smoking Status | Exercise Frequency | Property Type | Premium Amount | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| train | 0 | 0 | 19.0 | Female | 10049.0 | Married | 1.0 | Bachelor's | Self-Employed | 22.598761 | Urban | Premium | 2.0 | 17.0 | 372.0 | 5.0 | 2023-12-23 15:21:39.134960 | Poor | No | Weekly | House | 2869.0 |
| 1 | 1 | 39.0 | Female | 31678.0 | Divorced | 3.0 | Master's | NaN | 15.569731 | Rural | Comprehensive | 1.0 | 12.0 | 694.0 | 2.0 | 2023-06-12 15:21:39.111551 | Average | Yes | Monthly | House | 1483.0 | |
| 2 | 2 | 23.0 | Male | 25602.0 | Divorced | 3.0 | High School | Self-Employed | 47.177549 | Suburban | Premium | 1.0 | 14.0 | NaN | 3.0 | 2023-09-30 15:21:39.221386 | Good | Yes | Weekly | House | 567.0 | |
| 3 | 3 | 21.0 | Male | 141855.0 | Married | 2.0 | Bachelor's | NaN | 10.938144 | Rural | Basic | 1.0 | 0.0 | 367.0 | 1.0 | 2024-06-12 15:21:39.226954 | Poor | Yes | Daily | Apartment | 765.0 | |
| 4 | 4 | 21.0 | Male | 39651.0 | Single | 1.0 | Bachelor's | Self-Employed | 20.376094 | Rural | Premium | 0.0 | 8.0 | 598.0 | 4.0 | 2021-12-01 15:21:39.252145 | Poor | Yes | Weekly | House | 2022.0 | |
| 5 | 5 | 29.0 | Male | 45963.0 | Married | 1.0 | Bachelor's | NaN | 33.053198 | Urban | Premium | 2.0 | 4.0 | 614.0 | 5.0 | 2022-05-20 15:21:39.207847 | Average | No | Weekly | House | 3202.0 | |
| 6 | 6 | 41.0 | Male | 40336.0 | Married | 0.0 | PhD | NaN | NaN | Rural | Basic | 2.0 | 8.0 | 807.0 | 6.0 | 2020-02-21 15:21:39.219432 | Poor | No | Weekly | House | 439.0 | |
| 7 | 7 | 48.0 | Female | 127237.0 | Divorced | 2.0 | High School | Employed | 5.769783 | Suburban | Comprehensive | 1.0 | 11.0 | 398.0 | 5.0 | 2022-08-08 15:21:39.181605 | Average | No | Rarely | Condo | 111.0 | |
| 8 | 8 | 21.0 | Male | 1733.0 | Divorced | 3.0 | Bachelor's | NaN | 17.869551 | Urban | Premium | 1.0 | 10.0 | 685.0 | 8.0 | 2020-12-14 15:21:39.198406 | Average | No | Monthly | Condo | 213.0 | |
| 9 | 9 | 44.0 | Male | 52447.0 | Married | 2.0 | Master's | Employed | 20.473718 | Urban | Comprehensive | 1.0 | 9.0 | 635.0 | 3.0 | 2020-08-02 15:21:39.144722 | Poor | No | Daily | Condo | 64.0 |
| id | Age | Gender | Annual Income | Marital Status | Number of Dependents | Education Level | Occupation | Health Score | Location | Policy Type | Previous Claims | Vehicle Age | Credit Score | Insurance Duration | Policy Start Date | Customer Feedback | Smoking Status | Exercise Frequency | Property Type | Premium Amount | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| test | 799990 | 1999990 | 25.0 | Male | 33991.0 | Divorced | 2.0 | Master's | NaN | 5.081818 | Suburban | Basic | 0.0 | 12.0 | NaN | 4.0 | 2023-10-03 15:21:39.102694 | Average | Yes | Monthly | Condo | NaN |
| 799991 | 1999991 | 26.0 | Female | 90883.0 | Divorced | 2.0 | High School | NaN | 11.275420 | Rural | Comprehensive | 0.0 | 19.0 | 494.0 | 5.0 | 2021-03-14 15:21:39.170099 | Poor | No | Monthly | Condo | NaN | |
| 799992 | 1999992 | 33.0 | Female | 788.0 | Married | 1.0 | Bachelor's | NaN | 47.921197 | Urban | Premium | 0.0 | 18.0 | 722.0 | 5.0 | 2024-05-28 15:21:39.123711 | Average | No | Monthly | House | NaN | |
| 799993 | 1999993 | 52.0 | Female | 25426.0 | Divorced | 4.0 | Bachelor's | Self-Employed | 39.792397 | Suburban | Comprehensive | NaN | 12.0 | 702.0 | 1.0 | 2019-08-21 15:21:39.087123 | Poor | Yes | Daily | Condo | NaN | |
| 799994 | 1999994 | 23.0 | Female | 71758.0 | Single | 3.0 | PhD | Self-Employed | 22.837951 | Suburban | Basic | 2.0 | 5.0 | 452.0 | 6.0 | 2020-06-08 15:21:39.256696 | Good | Yes | Monthly | Condo | NaN | |
| 799995 | 1999995 | 50.0 | Female | 38782.0 | Married | 1.0 | Bachelor's | NaN | 14.498639 | Rural | Premium | NaN | 8.0 | 309.0 | 2.0 | 2021-07-09 15:21:39.184157 | Average | Yes | Daily | Condo | NaN | |
| 799996 | 1999996 | NaN | Female | 73462.0 | Single | 0.0 | Master's | NaN | 8.145748 | Rural | Basic | 2.0 | 0.0 | NaN | 2.0 | 2023-03-28 15:21:39.250151 | Good | No | Daily | Apartment | NaN | |
| 799997 | 1999997 | 26.0 | Female | 35178.0 | Single | 0.0 | Master's | Employed | 6.636583 | Urban | Comprehensive | NaN | 10.0 | NaN | 6.0 | 2019-09-30 15:21:39.132191 | Poor | No | Monthly | Apartment | NaN | |
| 799998 | 1999998 | 34.0 | Female | 45661.0 | Single | 3.0 | Master's | NaN | 15.937248 | Urban | Premium | 2.0 | 17.0 | 467.0 | 7.0 | 2022-05-09 15:21:39.253660 | Average | No | Weekly | Condo | NaN | |
| 799999 | 1999999 | 25.0 | Male | 24843.0 | Divorced | 3.0 | High School | NaN | 24.893939 | Suburban | Comprehensive | NaN | 15.0 | NaN | 8.0 | 2021-05-18 15:21:39.108562 | Good | No | Rarely | House | NaN |